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Abstract 

Identifying and preserving archaeological sites before they are des- 
troyed is a very important issue. In this paper, we develop a greatly 
improved archaeological predictive model ^4-PM eri /j ance ^ that predicts 
where archaeological sites will be found. This approach is applied to 
remotely-sensed multispectral bands and a single topographical band 
obtained from advanced remote sensing technologies such as satellites 
and Airborne Laser Scanning (ALS). Our APM en ^ nce( j is composed 
of band transformation, image analysis, feature extraction and clas- 
sification. We evaluate our methodology on the sensor bands over 
Ft. Irwin, CA, USA. A nested bi-loop cross-validation and receiver 
operating characteristics curves are used to assess the performance of 
our algorithm. We first validate our method on the east swath of Ft. 



Irwin and then test on a separate dataset from the west swath of Ft. 
Irwin. A convex combination of two methodologies: APM conven ti on ah 
which has been used among archaeologists for many years, and our 
APM enhanced, is demonstrated to yield superior classification perform- 
ance compared to either alone at low false negative rates. We compare 
the performance of our methodology on different band combinations, 
chosen based on the archaeological importance for these sensor bands. 
We also compare the two types of APMs in the aspects of input data, 
output values, practicality and transferability. 



1 Introduction 

Archaeological sites are uniquely important sources of knowledge relevant to 
history, culture and nature. They convey messages from the past and embed 
the link between people of this modern world with people who inhabited 
the same areas hundreds or thousands of years ago. From a pure scientific 
standpoint, these sites are extremely valuable for studying the patterns of bio- 
logical variations among humans and their ancestors, human achievements 
and linguistic origins. Protecting archaeological sites means guarding the 
archaeological heritage to which human beings are entitled. However, ar- 
chaeological sites are being destroyed at an incredible rate. They are faced 
with many threats such as road constructions, developments in urban and 



rural areas, mining, and agriculture (Chokhani 1979). Growing populations 



are crowding into areas not previously occupied by human beings. Once des- 
troyed, an archaeological site cannot be excavated. It is lost forever because 
archaeological research depends upon finding uncontaminated material in ori- 
ginal context. Sites are often destroyed before humankind becomes aware of 
their presence. Identification and preservation of archaeological sites against 
damage and destruction is remarkably important. 

Traditional archaeological predictive models (AP M conventiona i) are inten- 
ded to identify regions within which sites are likely to be found and deal 
only with how suitable a region is for a certain activity rather than actu- 
ally finding sites. Development of APM conven u ona i began approximately 50 



years ago (Willey 1953). It is an "inductive" method derived from obser- 
vations rather than from theory; for instance the environmental factors in 
APM conventiona i are selected by archaeologists based on experience and ex- 
pertise. It is a region-based predictive approach that incorporates factors 
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including slope, vegetation cover, proximity to water, elevation and so on. 
Some APM conventiona i models may be formed from a successful but informal 
hypothesis and later examined under simple statistical procedures. Often, 
archaeologists delineate regions by testing for frequency of sites against a 
random distribution. 

A mathematical formulation of a traditional APM is given by a function 



APM convent ional '■ — > R (1) 
AP ' conventional ■ 8 I ^ t. (2) 

The input s = (x, y) 6 {1, 2, w} x {1, 2, h} G M 2 is the geographic loca- 
tion of a site on an M 2 map of pixels, where we can consider x as the row num- 
ber and y as the column number in the map. The output t gives discretized 
and usually integer scores in K for regions, where higher scores suggest high 
possibility of archaeological regions and lower scores suggest regions where 
sites are less likely to be found. Fig [I] is an example of APM conventiona i for ar- 
chaeological habitation sites in Ft. Irwin. Regions where sites are most likely 
to be found are dark green, regions where sites are least likely to be found 
are red, and regions of intermediate likelihood are color coded accordingly. 

Because geological characteristics vary among regions, APM con „ en4iona / 
for different maps can have different influencing factors. Transferability of 
APM conventiona i is limited in the sense that archaeologists would always need 
to survey the field in a new region that depends on different archaeological 
factors. This is also very costly and time-consuming. Advanced technologies 
such as airborne and spaceborne satellites make possible the use of sensor 
bands to directly detect archaeological sites ( Comer and Bloni||2007a ). Ana- 
lysis of these sensor bands is one effective solution for site identification be- 
cause localized environmental changes from long ago have persisted to the 
present due to frequent human activities in the past. We intend to construct 



an enhanced APM en h an ced using sensor bands (Chen et al. 2013 Comer and 



Blom 2007b) with high transferability and accuracy. Such APMs can tre- 



mendously benefit the progress of preserving archaeological sites and save 
the cost and time of on-site investigation. Moreover, it is desirable to build 
enhanced APMs that yield low false negative rates. Failing to detect sites 
before construction projects begin, for example, can produce cost overruns 
when projects are delayed, and might even necessitate abandoning the pro- 
ject. 
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Figure 1: An example of AP M conventiona i for Ft. Irwin sites. The traditional 
APM has identified regions with which archaeological sites are more or less 
likely to be found. In the APMs for Fort Irwin, dark green areas are regions 
where sites are likely to be found, red areas where sites are least likely, and 
colors in between ranked accordingly. 



In this article, we develop an improved archaeological predictive model 



APM enhanced :R 2 ^R (3) 

APM enhanced :s*->t (4) 

to classify archaeological sites where 1 denotes a site of archaeological signi- 
ficance and denotes otherwise. 

The original dataset has two swaths (east and west) of Ft. Irwin mil- 
itary reservations, CA, USA. The dataset for each swath consists of eight 
multispectral bands from the WorldView-2 satellite and one slope band from 
Airborne Laser Scanning (ALS) ( Pratchett|2000 ) data. Based on the original 



nine bands, we construct four additional tassel cap (KTT) bands ( Yarbrough 



et al. 2005 ) and one normalized difference vegetation index (NDVI) band. We 
intend to classify archaeological sites from non-sites. 

We first train our method on 36 band difference ratios obtained from the 
eight multispetral bands and one slope band from east Ft. Irwin. To validate 
APM en hanced on the training set, we use a nested leave-one-out cross valida- 
tion where the inner loop identifies the best PCA dimension, and the outer 
loop uses the selected PCA dimension to classify the left-out sample. We 
assess the performance on the eastern portion of Ft. Irwin via the receiver 
operating characteristic (ROC) curve. We particularly use the area under 
the curve (AUC) as a summary statistic to evaluate the performance of our 



approach (Bradley 1997 1 1 Our methodology on the training set achieves 
both high accuracy and high true negative rate for low false negative rate 
for archaeological site classifications. We then test our method on the west 
swath of Ft. Irwin. We compare APM enhanced with APM conventional and find 
that APM en hanced outperforms the APM conventional in both training and test- 
ing sets, especially for low false negative rates. We also consider a convex 
combination model: 



APM^ = (1 - l)APM conV enUonal + ^fAP M enh anced (5) 

and show that for the eastern region, APM^ outperforms both APM conV entionai 
and APMenhanced- This is especially significant because the APM con ventionai 
has been evaluated as being high-performing as measured by the stand- 
ard metric generated for this purpose by archaeologists, the "gain statistic" 

1 We are aware of the controversies of using this statistic and use this evaluation with 
caution. 
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(Kvamme 1988). Then we use our method as a protocol for model selection 
by comparing the classification accuracies of APMs using 36, 45, 66, and 78 
band difference ratios. The comparison analysis of band combinations not 
only demonstrates the bands' levels of archaeological relevance to classifica- 
tion, but also demonstrates that a wise choice of features can help improve 
prediction accuracy. 

The remainder of the article is organized as follows. In Section 2, we 
describe the data set and present APM en hanced algorithm in detail. Section 
3 shows the results of training and testing on east and west swaths of Ft. 
Irwin, and compares the performance of APM conventionah APM enhanced and 
the convex model APM^. In Section 4, we apply APM enhanced to different 
dimensions of band difference ratios and summarize a comparison between 
APM ' conven tionai and APM en hanced ■ These modelling approaches have great 
applicability to both the management and preservation of archaeological sites 
and archaeological research ( Comer||2013 2011 Menze and Ur||201~2~ ) . In this 
article, we demonstrate that a clever architecture, which combines machine 
learning technique and human expertise, yields improved performance. 



2 The WorldView-2 and the ALS Data 



The data in our analysis has nine remotely sensed bands ( |Comer|2006 ). Eight 



of them are obtained from the WorldView-2 satellite^] imagery with bands: 
coastal blue, blue, green, yellow, red, red edge, near infrared I, near infrared 
II. An image of the coastal blue band is seen in Fig [2]. The ninth band: 
slope, is calculated from the Airborne Laser Scanning (ALS) data collected by 
Department of Defense sensors. Our study analyzes two swaths of territory 
where the first swath covering a large part of the eastern portion of Ft. Irwin, 
collected on Sep 22, 2011, is used as a training set and the second swath 
covering the western portion of Ft. Irwin, collected on Dec 30, 2010, is used 
as a testing set. These 8 spectral bands were orthorectified to 2-meter ground 
resolution with 11- bit radiometric resolution (stored as 16-bit integers). We 



2 WorldView-2 is the second next-generation high-resolution satellite of DigitalGlobe, 
Inc., Longmont, CO, USA (http://www.digitalgl obe.coml ). The satellite has eight spectral 
sensors in the near infra-red range. 

3 There are fourteen swaths of WorldView-2 satellite imagery data covering the entire 
land area of China Lake and Ft. Irwin military reservations in California. 
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Figure 2: Image of Coastal Blue Sensor Band 



form an eight spectral band data set with 9859 columns and 61098 rows^ 
All locations outside of the Ft. Irwin land area boundary were masked out, 
as shown in Fig |3j. 



The slope data used in this study is derived from ALS data (Pratchett 



2000) collected by the Department of Defense. It consists of images created 
from the data orthorectified to 2-meter ground resolution. The data ranges 
in integer values from to 86 degrees, where 90 degrees indicates a sheer cliff. 
The slopes for the east Ft. Irwin and west Ft. Irwin have different sizes from 
the WorldView-2 multispectral bands. Thus we crop the multispectral bands 
data to a 9859-column-by-23000-row subset data that covered the grounds 
of a western portion of Ft. Irwin for the testing set, and a 9002-column-by- 
15780-row subset of the eastern swath was used as the training set so that 
all the sensor bands have a consistent size. Although the slope band and 
multispectral bands are generated from different devices, we made sure they 
are registered for the correct site locations. 

We also consider four additional tassel cap bands (KTT): brightness, 
greenness, wetness and the fourth KTT band (Yarb rough et al. |2005 ), all of 



4 DigitalGlobe provided the swath of the western Ft. Irwin data in 45 separate 4096 x 
4096 pixel blocks of data originally. 
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Figure 3: The east and west swathes of Ft. Irwin. 



Table 1: WorldView-2 spectral bands. 



Band 


Lower and Upper Band 


Center Wavelength 




Edge (nm) 


(nm) 


Coastal Blue 


396 and 458 


427 


Blue 


442 and 515 


478 


Green 


506 and 586 


546 


Yellow 


584 and 632 


608 


Red 


624 and 694 


659 


Red Edge 


699 and 749 


724 


Near Infrared 1 


765 and 901 


833 


Near Infrared 2 


856 and 1043 


949 
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Figure 4: Site locations in eastern Ft. Irwin swath. The light blue gridded 
rectangular patch is the subset of the original data used in our analysis. The 
data is 2-meter resolution, which means there is a point every 2 metres. For 
Ft. Irwin swaths, there are too many points to process into surface models 
like elevation and slope so the data is distributed in grids as shown in this 
figure. 

which were linear combinations of the spectral bands. In Section 4, we discuss 
four types of band combinations that include the KTT and NDVI bands, and 
compare the performances of APM en h ance( i on these combinations. 

3 The Enhanced Archaeological Predictive Model 

Our method APM en h an ced is a composition of four functions h o g o / o V, 
where 



T> : M 2 — > M 2 , as band transformation 
/ : R 2 — > M. d , as image processing 
g : R d -> R d with d < d, as feature extraction 
h : J$L d — > [0, 1], as classification 



(6) 
(7) 
(8) 
(9) 
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3.1 The Band Difference Ratio Applied to Data Bands 

Each site s = (x,y) is a point in R 2 , where we can consider x and y denote 
the row and column locations respectively in a map of pixels. A feature 
image where 1 < i < B, defined on IR 2 is a random image which maps 
each site location to an intensity value peR: 

B {i) : R 2 -> M,for 1 < i < B (10) 

A multi-band image is a 5-tuple of bands (£ (1) , B {2 \ . . . , B (S) ) G R^ . Since 
the multispectral bands and the slope band were collected at different times of 
the year, in order to gain robustness against the changes in lighting conditions 
at different times of data collection, the data is transformed via the band 



difference ratio (Marchisio et al. 2010) as band transformation 



V : R 2 R 2 (11) 

V : (x, y; jgW . . . , h> (x, y; B« . . . , B (S) ) (12) 

where 

_ «(i) / r\ 

B {k) = " ^— , for all i > j, and 1< Jfc < . (13) 

BW+B®' ~ -\2J 1 ; 

Our result in Section 4 is based on usine; B = 9 and B = Q = 36 
band difference ratios for our data analysis. In Section 4.3, we compare the 
performances using other dimensions of band difference ratios. Note that we 
can also use the identity function as a band transformation or a combination 
of band difference ratios and identity function. 

3.2 Annuli Method in Image Processing 

The image processor ( Schowengerdt 2006[ ) is a function 



/ : R 2 -> R d , (14) 
f:(x, y] B^,...,B^)^w (15) 
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that takes a location and B image bands and returns a vector. This function 
returns local statistics about the image for each location. We suppose that 



f(x,y;BV,...,BW) 



{f B (x,y;B^)J 



V(x,y),BU,...,BW 



(16) 

where each component : M. 2 — >■ M. d ^ B depends only on band b for each 
b G {1, . . . , B}. Further we assume that all the functions /& are identical, so 
that the same statistics are computed for each image. 

In our study, the sensor bands reflect differently around the archaeological 
sites, seen in Fig j5j. To capture the differentiating characteristics, we use a 
particular imaging processing method based on statistics of pixels in annuli 
centered at each site and compute two robust statistics. Let 



A s {r {m \r {out) ) = {s' G R 2 : r {in) < 



< r 



(out) 



} 



(17) 



be the set of coordinates in the annulus centered at s G R 2 with inner radius 



,(m) anc j ou t er radius r^ out \ We define 



v : 



Vi(s,B) = median{£v : s' G A s (rf n , r\ ou ')} 



Si(s, B) = MAD{B S ' : s' G A s {r { : n \r ( r t] )} 

= median{|i3 s / - Ui(s)\ : s' G A s (rf n \ r-° Mt) )} 



30 and 5 : M 2 ^ IR 30 by 

(18) 

(19) 



for i G {1,2,..., 30} where MAD denotes the sample median absolute devi- 
ation. 

The inner and outer radii ranges we consider are given in this table: 



i 


1 


2 


3 .. 


. 10 


11 


12 


13 .. 


. 20 


21 


22 


23 . 


.. 30 


r (in) 





3 


6 .. 


. 27 





5 


10 .. 


. 45 





7 


14 . 


.. 63 




2 


5 


8 .. 


. 29 


4 


9 


14 . . 


. 49 


6 


13 


20 . 


.. 69 



Since the annuli overlap and cover a relatively large area, the statistics for 
these ranges will likely contain some redundant information so the feature 
extraction step will be useful to reduce this redundancy while maintaining 
the components that discriminate between classes. 
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Multispectral Imaging 

Band 3-Red Band 

Band 4-Near Infrared Band 




Figure 5: At typical sites in our test area: There is a core of longer grasses 
in the middle of the site and further away from the center, vegetation is 
more sparse. Band 3, red, is almost completely absorbed by healthy, lush 
vegetation, while Band 4, near-infrared, is strongly reflected by healthy, lush 
vegetation. The bands reflections thus are different at different parts of the 
site, but even more different from the surrounding vegetation. This is the 
reason that we use the annuli approach. 



3.3 Feature Extraction and Classification 

Each location is now represented by a vector in R d (with d = 605). The 
dimension d is much larger than the sample size n. This often leads to "the 
curse of dimensionality" ( Duda et al.|2000 Tan et al.|2006 ), a phenomenon in 



which the available sample sizes are insufficient to build reasonable models of 
the high dimensional data. Such high dimensionality can also result in com- 
plicated computational issues and reduces classification accuracy. Feature 
extraction eliminates irrelevant features and reduces noise in the data set. 
Moreover it converts high dimensional feature space to a lower dimension 
that captures most of the variation of the original data. Mathematically, the 
feature extractor g is a function such that: 

g : R d -> R d , with d < d. (20) 

In this paper, principal components analysis (PCA) is used for feature 
extraction. The classifier h is a function 

/i:M d ^[0,l]. (21) 
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In our approach, g and h are very much intertwined in the sense that, h 
is applied in the procedure of choosing PCA dimension d, and the optimal 
d* is applied for the data before classification h is done. Further interaction 
between g and h is through the nested loops in leave-one-out cross validation. 
An algorithmic explanation of this procedure is presented in Algorithm 1. 

The classifier we choose is linear discriminant analysis (LDA). The pre- 
dicted class is given by y pre d = > r }> i- e > the s ^ e nas archaeological sig- 
nificance if the estimated posterior probability is greater than the threshold 
r G [0,1]. In Section 5, we discuss using another classifier (k, Z)-nearest 
neighbor rule (Devroye et al. 1996) and compare the PCA errors of both 
classifiers. 

Receiver operating characteristic (ROC) curves are parametrized by the 
the threshold r. For each value of r, we plot the percentage of false positives 
on the x-axis and the percentage of true positives on the y-axis. The area 
under the ROC curve (AUC) is a summary statistic which averages the true 
positive over all choices for the false positive rate. 



Algorithm 1 Use APM enhanced to classify archaeological sites 

1: Apply band difference ratio to multispectral band data 

2: Apply the annuli technique / and calculate the medians and MADs for 

each annulus 
3: for s = 1 : n tes t do 
4: for d = 1 : (ntegt ~ 1) do 
5: Apply h o g rf ( s ) on the training set 

6: end for 

7: Calculate misclassification error e<w s ) 
8: Pick = argmin { i t _ A} e d ( s) . 

9: Apply h o g d * to test s and get the posterior probability 
10: end for 



4 Results from the Ft. Irwin Site Research 

4.1 Assessing the Performance on Training Data set 

We analyze APM en h anced on the multi-band dataset. There are n\ = 37 
archaeological sites in Class 1. For Class data, we select n Q = 100 surveyed 
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East Sites as Training Set 




APM. enhanced 
APM.oIci 

Best APM. combo 
Half APM. combo 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 
False Positive Rate 

Figure 6: ROC for Training on Eastern Ft. Irwin Sites 



locations uniformly random [J and ensure these regions are far from known 
sites. Each site used in this analysis was examined by Cultural Site Research 
and Management (CSRM) under a contract with the Department of the 
Army to determine the eligibility of these for sites for inclusion in the National 
Register of Historic Places. This investigation was done under the direction 
of Douglas C. Comer, during 2011 and 2012. Precise locations were recorded 
with geographical positioning system (GPS) equipment, and site types were 
determined, and reviewed, by archaeologists who specialize in the Mojave 



Desert region. The APM for Ft. Irwin (Ruiz 2003) is specifically defined as 



APM, 



conventional 



{0,0.2,0.4,0.6,0.8,1} 



(22) 



where indicates the site is least likely to be archaeological and 1 the most. 
We further consider a convex combination APM^. For each value of 7 G [0, 1] , 



APM~ 



7) APM, 



conventional 



+ >yAPM enhanced . 



(23) 



Figure [6] shows the receiver operating characteristic (ROC) (Bradley 
1997) curve of lithic sites corresponding to four classifiers: APM conventiona i 



5 Surveyed locations are denned as locations where archaeologists surveyed but dis- 
covered nothing. Thus it is safe to assume they are non-sites. 
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AUG for APM. combo on Training Set 

0.88 - 
0.86 - 
0.84 

CD 
£Z 

a 0.8 ■ 

■5 

0.78 ■ 
0.76 - 
0.74 L 

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 
Gamma 

Figure 7: AUC for Training on Eastern Ft. Irwin Sites. Notice that there is 
a jump at 0. That is because APM en h an ced acts like a tiebreaker and rescales 
the discretized values from APM conven u on ai- 

(7 = 0), APM en hanced (7 = 1)> APM 1=0 , 5 , and the optimal combined model 
APM^ =1 *. Figure [7) shows how we choose the best 7 to get APM^ =1 * and 
7* = 0.3. For different choices of false positive percentage (or true positive 
percentage) we see that APM^ typically improves performance over using 
either the APM conventiona i and APM enhanced alone. The consistent superior 
performance of APM^ to APM conventiona i indicates that including our method 
increases prediction power over using the APM conventiona i alone. Also the 
false negative rate is lower in our enhanced model than in APM conven u ona i- 

4.2 Testing on the West of Ft. Irwin Dataset 

We train on the eastern sites using our method and test on the western 
Ft. Irwin sites. There are n\ = 49 archaeological sites and uq = 100 non- 
archaeological sites chosen at random from the western region surveyed map. 
We ensure that these non-sites are selected at least 200 meters away from 
the found and not-found sites. AP M en h an ced is trained on the eastern region 
and tested on the western region. 

Figure (8) shows that our method in general outperforms APM, except 
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West Sites as Testing Set 




Figure 8: ROC curve for testing on the western lithic sites 



AUC for APM.combci nn Testing Set 




Figure 9: AUC for testing on the western sites. The first jump is a discon- 
tinuity because APM en h an ced acts like a tiebreaker. The longer flat line in 
AUC from 7 G [0, 0.7] (compared to 7 G [0, 0.23] in Fig (7j) is due to the fact 
that the scores from testing have a smaller range. 
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from when false positive rates in [0, 0.23]. The convex combination model at 
the optimal 7 beats APM starting from false positive rate at 0.13, indicating 
that including our method improves classification power. We also see that 
our method achieves higher true positive rate at lower false negative rate than 
the APM from 0.13 to 1. Since we are more interested in achieving lower false 
negative rate, we mainly concerned the false positive rate ranging from 0.15 
to 1. Indeed, at these rates, our method outperforms the APM conventiona i. 
Figure [9] shows how the AUC of APM^ is varied for different choices of 7. 
We see that 7 closer to 1 has higher AUC, which indicates weighing more 
APM enhanced increases classification accuracy. 



5 Discussion 

In Section 3.1, we describe the band transformation function in APM enhanced 
and, in particular, use 36 band difference ratios. We also examine many 
sensor bands: eight multispectral bands, four KTTs, slope, one NDVI, where 
the four KTTs and one NDVI ( |Yarbrough eTaLl[2005l ) were not independent 



from, moreover generated by the eight multispectral bands. We consider 
cases of 15, 36, 66 and 78 band difference ratios, where the combinations 
are chosen by their archaeological importance and are demonstrated in Table 
[2] . We show that using the 36 dimensions produces the highest classification 



accuracy as in Fig 10 . Since the imaging technique extends each band to 
60 dimensions and with only above 100 training samples, higher dimensional 
features contain too much noise in the dataset while low dimension features 
do not capture enough information. Nevertheless, if given more training 
samples, the analysis using 66 dimensions may generate better results. 

In our classification scheme framework, we compare performances of LDA 



versus (k, /)-nearest neighbor rule (Devroye et al. 1996) given by: 



1 ifE^i^>i 
h(n)= { ifE"=i^< k-l (24) 
— 1 otherwise. 

We have empirical evidence that the PCA error of (k, /)-NN is lower than 
that of LDA, since the best PCA dimension is much smaller using (k, Z)-NN. 
However, by the bias- variance trade-off, the predicting performance of (k, l)- 
NN does not work as well as for LDA. However both classifiers still achieve 
higher classification accuracy than APM conventiona i. 
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Table 2: Compare Band Difference Ratio Dimensions 



Number ofBDR 


Names of bands 


Perform 


15 BDR 


Slope, NIR-1, red edge, brightness, 
greeness, wetness 


Worst 


36 BDR 


Slope, 8 WorldView-2 bands 


Best 


66 BDR 


Slope, 8 WorldView-2 bands, bright- 
ness, greenness, wetness 


Second best 


78 BDR 


Slope, 8 WorldVIew-2 bands, 4 KTT 
bands 


Third best 
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In our framework, the criteria for selecting PCA dimension d* is based on 
lowest error rate e. But we can also consider the ratio of the eigenvalues of 
the covariance matrix over the sum of all the eigenvalues. In that sense, we 
are choosing d* based on percentage of the variation of data and a common 
threshold is 95%. In this paper, the area under the ROC curve is used as a 
summarizing number for testing the performance of our algorithm. However, 
we can also consider using precision-recall as another measure to evaluate 
the performances. 

Our improved archaeological predictive methodology APM en h an ced, spe- 
cified by the four components: band transformation to data, image processor, 
feature extractor, and classifier, is applied to archaeological site discovery us- 
ing multispectral and topographical imagery data. AP M en h an ced is trained 
on the eastern region of Ft. Irwin and tested on the western region of Ft. 
Irwin. APM en hanced demonstrated not only lower error rate but also lower 
false negative rate. With low false negative rate, road constructions, or other 
activities that disturb the ground including military maneuvers have less 
chance building through or ruining an archaeological site nearby, thus saving 
tremendous costs. 

APM conventiona i, a standard means for identifying archaeological sites, in- 
puts features such as slope, vegetation, elevation and so on. These factors 
may not be preserved well and the input values are measured by archaeolo- 
gists investigating in the field. APM conventiona i predicts on low-dimensional 
data, usually less than 10. APM en hanced is a statistical learning approach that 
applies on multispectral band data acquired from advanced remote sensing 
technologies and predicts on high-dimensional data. 

APM conventiona [ outputs a few discretized values. The ability to adjust 
false negative rates is limited because the threshold r can only make a 
difference at few values. APM en h an ced, on the other hand, outputs much 
more values so that false negative rates can be adjusted lower. At the same 
threshold r, APM enhanced has lower false negative rate than APM conventional . 
APM conventional returns likelihoods of whether a region contains archaeolo- 
gical sites so sites within a region have the same APM coni , eritiona ;-probabilities. 
APM en hanced returns probabilities of single sites as to enhance the accuracies. 
APM CO n V entionai is a map-based approach because a different map may be in- 
fluenced by a different set of geological and environmental factors. Hence it 
has less transferability and requires archaeologists to survey the field, which 
is costly and time-consuming. Our machine-learning-based APM en hanced ap- 
proach has operationally useful transferability as shown in this paper. Al- 
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though APM enhanced and APM conventional have many differences, they are 
fundamentally used for finding archaeological sites. Combining both meth- 
odologies APM 1 generates even better results. Our analysis strongly sug- 
gests that combining statistical learning methods and human professional 
knowledge can bring forward fascinating outcomes. 

References 

A. P. Bradley. The use of the area under the roc curve in the evaluation of 
machine learning algorithms. Pattern recognition, 30(7):1145-1159, 1997. 

L. Chen, D. C. Comer, C. E. Priebe, D. Sussman, and J. C Tilton. Re- 
finement of a method for identifying probable archaeological sites from 
remotely sensed data. Mapping Archaeological Landscapes from Space: In 
Observance of the 40th Anniversary of the World Heritage Convention, 
2013. 

P. Chokhani. Destruction on the Public Lands: A Closer Look at Vandalism. 
1979. 

D. C. Comer. Aerial and Satellite Remote Sensing in Archaeology. Academic 
Press, 3rd edition, September 2006. 

D. C. Comer. Hydrology, human occupation, and preservation on the land- 
scape of petra. Tourism and Archaeological heritage Management at Petra: 
Driver to Development or Destruction?, 2011. 

D. C. Comer. The influence of viewshed on prehistoric archaeological site 
patterning at san clemente island as suggested by analysis of synthetic 
aperture radar images. Mapping Archaeological Landscapes from Space, 
2013. 

D. C. Comer and R. G. Blom. Detection and identification of archaeological 
sites and features using synthetic aperture radar (sar) data collected from 
airborne platforms. In J. R. Wiseman and F. El Baz, editors, Remote 
Sensing in Archaeology, pages 103-136. Springer Science + Business Media, 
LLC, New York, 2007a. 

D. C. Comer and R. G. Blom. Remote sensing and archaeology: Tracking 
the course of human history from space. Earth Img. J., Mar/Apr 2007b. 



20 



L. Devroye, L. Gyorfi, and G. Lugosi. A probabilistic theory of pattern recog- 
nition, volume 31. springer, 1996. 

R. O. Duda, P.E. Hart, and D. G. Stork. Pattern Classification. Wiley- 
Interscience, New York, 2nd edition, October 2000. 

K.L. Kvamme. Development and testing of quantitative models. Quantifying 
the Present and Predicting the Past: Theory, Methods, and Applications 
of Archaeological Predictive Modeling. US Department of Interior, Bureau 
of Land Management Service Center, Denver, Colorado, USA, pages 325- 
428, 1988. 

G. Marchisio, C. Padwick, and F. Pacifici. Evidence of Improved Vegetation 
Discrimination and Urban Mapping Using WorldView-2 Multi- Spectral Im- 
agery. Baltimore, Maryland, May 2010. 

B.H. Menze and J. A. Ur. Mapping patterns of long-term settlement in the 
near east on a large scale. Proceedings of the National Academy of Sciences 
of the United States of America, pages 109: E778-E787, 2012. 

T. Pratchett. The light fantastic. HarperTorch, 2000. 

M. O. Ruiz. The development and testing of an archaeological predictive 
model. Technical report, U.S. Army Corps of Engineers Construction En- 
gineering Research Laboratory, Champlain, Illinois, 2003. 

R. A. Schowengerdt. Remote Sensing: Models and Methods for Image Pro- 
cessing. Academic Press, 3rd edition, September 2006. 

P. N. Tan, M. Steinbach, and V. Kumar. Introduction to data mining. Pearson 
Addison Wesley, Boston, 2006. 

G.R. Willey. Prehistoric settlement patterns in the viru; valley, peru. 1953. 

L.D. Yarbrough, G. Easson, and J.S. Kuszmaul. Using at-sensor radiance 
and reflectance tasseled cap transforms applied to change detection for the 
aster sensor. In Analysis of Multi-Temporal Remote Sensing Images, 2005 
International Workshop on the, pages 141-145. IEEE, 2005. 



21 



